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An  Estimation  Problem  with  Poisson  Processes 

By 

Alan  E.  Gelfand 


1.  Introduction 

lhis  paper  considers  the  following  problem.  Suppose  we 
observe  a  collection  of  independent  Poisson  processes,  X^t) , 

1  -  1,2,..., a  such  that 

*  x±Ct)  ~  P0(X1(t) 

with 

X±(t)  -  «±p<t)  . 

Let 

t 

p(t)  -  /  p(u)du 

0 

If  all  the  processes  are  observed  up  to  a  fixed  tine  point  T  , 
bow  nay  we  estimate  p(t)?  If  we  assume  p  Integrable  on  C0,°») 
and  without  loss  of  generality  set  p(»)  ■  1  then  each  X^t) 
converges  in  distribution  to  X^(«)  where 

X1(»)  *  P0(6i)  . 

In  section  2  we  formalize  the  mathematical  setting  for  this 
problem.  In  section  3  we  describe  two  estimation  approaches  and 
Illustrate  these  approaches  with  two  choices  of  p  .  Application  of  this 
sort  of  estimation  problem  arises  when  there  Is  Interest  in  the  conditional 
distribution  of  X(t)  given  X(«)  that  Is,  in  the  proportion  of 
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occurrences  of  a  process  which  nay  be  expected  by  tine  t  .  In  section  4 
a  successful  application  of  the  methodology  to  the  active  life  of  a 
judicial  opinion  is  described. 


2.  The  Set  Up 
Suppose 


with 


X(t)  -  P0(MO) 

MO  -  6p(t) 


Suppose  further  that 

(i)  p  is  integrable  with 
t 

p(t)  -  /  p(u)  du 

0 

and  that  lim  p(t)  =  p(«°)  *  1  . 

t  -*■  <*> 


(ii)  p(t)  strictly  decreases  in  t  ,  p(0)  <  » 

(iii)  p'(t),  p”(t)  exist,  t  >  0  . 

Two  particular  examples  of  interest  in  the  sequel  are 
PjCtju)  -  ae~at  ,  o  >  0  with 
Pj(t;a)  -  1  -  e"01 


and 

p,(t;a)  -  — - — =•  ,  a  >  0  with 

(a+t)2 

P2(t;a)  »  t/(a+t)  . 
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In  general  p  and  p  tend  to  look  aa  in  Figures  1  and  2. 


Fig  1:  A  typical  p(t) 


Fig  2:  A  typical  p(t) 


In  view  of  our  assumptions 

X(t)  +  X(»)  where  X(«>)  ~  FQ(6) 

with  interpretation 
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X(t)  ■  #  of  occurrences  of  the  process  by  time  t  . 

X(°°)  a  f  of  occurrences  of  the  process  st  the  end  of  time. 

Why  is  p(t)  of  interest?  The  conditional  distribution  of  X(t) 
given  X(“)  is  binomial  i.e. 

X(t)|x(»)  -  B1(X(«),  p(t)) 
and  therefore  simple  coaputation  shows  that 


E[X£tl 

E[x(“) 

|  X(»)  >  0 

)  ■ 

P(t) 

varf  5!£l 
varl  X(») 

|  X(«)  >  0 

)■ 

c(6)  p(t)(l-p(t»  . 

c(6)  - 

-6  -1 
(1-e  ) 

09 

l 

i  • 

y-i 

y  y* 

Hence  p(t)  measures  the  expected  proportion  of  occurrences  of 
the  process  by  tlae  t  .  An  implication  of  our  assumptions  on  p 

Is  that  for  any  Interval  of  fixed  length,  the  earlier  in  tlae  it  is  placed 
the  greater  the  expected  number  of  occurrences  in  the  Interval. 

The  significance  of  p(t)  is  more  strongly  emphasized  in  the 
case  where  we  have  n  Independent  processes  X^(t)  such  that 

XA(t)  «.  1  •  l,.#«,n 

where 

(1)  X1(t)  -  «tpCt)  . 

In  this  case  we  would  like  to  synthesise  informstion  from  the  n 
processes  to  estimate  the  common  p(t).  In  other  words,  for  sny 
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individual  process  there  would  be  little  value  In  factoring  a  6 
from  the  intensity  function.  However,  for  some  phenomena  that  we  night 
sample  (see  the  application  in  section  4)  the  behaviour  night  be  such  that 
froa  sample  to  sample 

i)  the  absolute  intensity  (i.e.  expected  rate  of  occurrence)  varies 
but 

ii)  the  relative  intensity  (i.e.  ratio  of  expected  rate  of 
occurrence  at  two  different  time  points)  does  not. 

In  such  a  case  the  form  for  X^(t)  in  (1)  is  appropriate.  In  our 
application  we  envision  considerable  variation  in  the  6^  .  For  some 
sampled  processes,  occurrences  will  be  very  sparse  while  for  others  they 
will  be  quite  numerous.  But  again  our  Interest  is  not  in  the  6^  but 
in  p(t)  . 

Suppose  then  that  we  observe  the  n  processes  during  the  interval 
CO.T].  How  may  we  estimate  p(t)  ?  (The  reader  may  be  curious  as  to 
whether  differing  lengths  of  observation  time,  for  differing  sampled 
processes  can  be  accomodated .  We  will  return  to  this  question  shortly.) 
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3,  Estimation  Approaches 

la  attempting  to  estimate  p(t;a)  using  data  in  [0,T3 
we  consider  two  approaches. 

A.  Dse  the  actual  arrival  tines  of  the  occurrences 

B.  Partition  C0,T]  into  intervals  and  record  numbers  of 
arrivals  in  each  interval. 

Clearly  the  latter  approach  results  in  some  loss  of  information  compared 
with  the  former.  We  shall  obtain  a  measure  of  this  loss  and  apply  it 
to  our  p's  of  Interest.  In  applications,  it  may  be  the  case  that 
data  is  gathered  categorically  for  reasons  of  convenience  or  reliability 
that  only  approach  B  is  available.  Bence  an  indication  of  the  loss 
of  Information  while  still  of  interest  may  no  longer  be  relevant. 

Consider  Approach  A.  From  Cox  &  Binkley  (p.15-16)  if 

X(t)  ~  PQ(6p(t))  and  in  the  interval  [0,T]  (ordered)  occurrences 

are  observed  at  w.t  w_, . . . ,v  then  the  unconditional  likelihood 
i  4  n 

becomes 

L(p,d,m,w^,V2» ... 

T 

B  -6 J  p(u)du 

(2)  -  «*  TT  P<V  e  ° 

i-1  1 

The  conditional  likelihood  given  X(t)  -  m  thus  becomes 
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L(p,wltw2,w3, . . .  ,wB|X(T)-a) 

T 

-6  /  p(u)du 

*»!  SmTF  p(w.)  e  ° 

•  •  » 


i-1 


T  -6/  p(u)du/m 

6*L  /  p(u)dtQm  e  0 
0 


(3)  -  a!  TT  p(w.)  (p(T))“*  , 

i-1  1 

free  of  6  . 


For  n  samples  with  occurrences  time  v^,  1  -  l,...,n  , 
j  -  1 . m^  (that  la  X^(T)  -  n^)  ,  Em^  -  a  ,  we  slailarly  obtain 


L(P,61,  m±,  w±J) 


(4) 

■'  (  TT 

i  n 

!  TT 

t  i-1  1  J 

i-i 

and 

*^P,wijlXi/T)  *  *1.  * 

i-1 

"i 

(5) 

-  TT  TT 

TT  p 

i.l 

-(Z6.)pCT) 


lote  that  if  sample  i  is  observed  over  CO.T^]  the  only  change  to  be 

n  -m. 

aade  in  (5)  is  that  (p(T))"11  is  replaced  by  TT  (P(T-))  )  . 

i-1 


Suppose  p  is  of  the  form  p(t;a)  (p  of  the  form  p(t;a)) 
so  that  with  estimation  of  a,  p  is  specified  and  thus  p  is 
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estimated.  Then,  in  (2) 
a  and  6  are 

?— lo8 It  »  y  . 

3a  L 

ULs&Ji  -  “ 


the  unconditional  likelihood  equations  for 

8  log  p(w1;a)  ap(T:a)  . 

5a  3a 

-  P(T;a)  -  0 


whence 


3  log  pCw^a) 

l  - 5a - 


-  m 


3p(T;a) 

3a  ' 


/p(T;a) 


0  . 


In  (3)  this  conditional  likelihood  equation  for  a  is 
3  log  L/m  3  log  p(w.;a)  3  log  p(T;a) 

(7)  - 3a  “  l - 5a  “  * - 55 -  *  0  * 

Hence  as  is  well  known  (e.g.  Cox  and  Lewis*  Chap  3)  the  conditional 
likelihood  equation  for  a  is  the  sane  as  the  unconditional  one. 

We  nay  rewrite  (7)  as 

3  log(p(w  ;a)/p(T;a)) 

(»)  I  - k -  ■  0  • 

In  (8)  the  conditional  nature  of  the  likelihood  equation  is  most 
apparent  i.e.  P(u;a)/p(T;a)  is  the  density  function  for  an  occurrence 
given  it  occurred  by  time  T  .  Thus  if  61  is  a  solution  (8) 
and  the  usual  regularity  conditions  are  satisfied*  <3  will  be 
asymptotically  normal  with  mean  the  true  a  and  asymptotic  variance 
(1(a))”1  where 
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1(a)  -  E 


3  logCpCWjjaJ/pCT 
__ 


(9) 


-  E 


’  3  log  p(W;a)  ■ 

2  ^3  log  p(T;a)' 

da 

•  4 

3a 

Computationally  a  more  convenient  form  may  be 

x 2 

i-OglP1 

3a' 


1(a) 


5-E[ 


3  log(p(W;a)/p(T 
2 


(10)  -  _  E  -9-2  fog  Sfflig)  +  g2  log  p(_T;a)  . 


3a 


3a 


For  the  case  of  n  samples,  a  is  a  solution  to 


(ID 


n  i  3  log  p(W^;a) 
i-1  j=l 


la" 


3  log  p(T;a) 
3a 


from  which  we  see  that  the  separation  of  the  occurrence  times  by  samples  is 

superfluous.  The  total  of  m  occurrence  times  may  be  considered  as 

a  single  sample.  This  may  be  noted  directly  by  considering  the 

n 

occurrence  times  of  the  process  Y(t)  -  Zj  X±(t)  in  [0,T]  . 

Returning  to  (7),  consider  Pi(t;a)  -  ae-at  .  We  obtain 


m  T  e 


-OT 


1  -  e 


-aT 


0 


or 


m  T  e 


-aT 


1-e 


-aT 


and 
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r 


l  _2  -aT 

C12)  1(a)  -  -2  -  --■■^2-  ‘ 

a  (1-e  ) 

(Again  see  Cox  and  Lewis,  p.47). 

2 

For  P2(t;a)  -  o/(a+t)  we  obtain 


or 

v  f  1  }  m(2a4-T) 

*  (  a+w±  J  2a(a+T) 

and 

(13)  1(a)  -  T2/3a2(a+T)2  . 

We  now  consider  approach  B  .  Suppose  we  define  k.  intervals  by 
0  <  tA  <  t2  <  ...  -  T  .  Then  for  one  sample 

X(t1),  X(t2)  -  X(tj)  ,...,  X(tk_1)  -  X(tk_2)|X(T)  =  m 

~  Multinomial  (m,  q^»  q2»****qjt) 


where 

.  ptt3)  -  pcyp 

qj  P(T) 


For  n  samples  let 


j  m  1 » • • • »k 

(to  =  o) 


*1 
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Then  Z  -  (Z^f Z^, . . . is  complete  and  sufficient  given 


i 

X^(T)  -  m^  and  in  fact  is  multinomially  distributed  as  veil  i.e. 

—  n) 

2  -  Multinomial  (m, 

Two  natural  estimation  approaches  arise  at  this  point:  the  method  of 
minimum  chi  square  and  the  method  of  maximum  likelihood  (method  of  moments 
estimation  might  be  considered  as  well) .  The  former  leads  to  consideration 
of  the  function 

k  2 

l  (Z,  -  mq)  /mq 

j-1  2  2  2 

while  the  latter  leads  to  the  (conditional)  likelihood  function 


^(p, Zl • • • • jZ^/m) 


k 

-  TT  i* 

j-i  2 


Again  presuming  p  is  of  the  form  p(t;a), 

likelihood  equation  for  a  is 

3  log  L/m 
3a 


3  log  q^(a) 
3a 


the  resulting 


0 


or 


3  log  (p(t.;a)  -  p(t  . ;a)) 

«*>  l  *3 - hs - - 


m3  log  p(T;a) 
3a 


-  0 


Again  if  fig  (G  for  grouped)  is  a  solution  to  (14),  under  the 
usual  regularity  conditions  fig  will  be  asymptotically  normal  vlth  mean 


where 


the  true  a  end  asymptotic, variance  (I. (a)) 


-1 


I„(a) 


.  G 


Mi 


(15) 


3  log  ^(a) 
3o2 


3  log  q.(a) 

■  -  I  <1  j  (a)  - — ^ - 

3a2 


or  in  terms  of  the  p^(a) 


(16)  (p(T;a))-1J[  (pCt^ja)  -  p(tJ_1;a)) 


-1 


^(pCtji^-pCt^^^a))' 


3a 


2  ^3  log  p(T;o)-|2 

'[  to  J 


The  similarity  between  expressions  (7)  and  (14)  and  between  (9)  and 

(16)  is  not  unexpected.  In  fact,  if  as  k  *►  •  ,  sup  (t  -t  ,)  0 

1  <  j  <  k  3  3~l 

it  is  straightforward  to  show  that  the  "categorical"  expressions  converge 
to  the  corresponding  "continuous"  ones  . 

In  (14),  consider  p^(t;a)  ■  ae”at  .  Since 

we  obtain 

(17) 

and 


V“> 


-“*.1  -“j 

#  j  —  p  j 


1-e 


-aT 


-at. 


t  e  -  t  e 

J  Z  -ti - 1 


-at. 


“j 


"atj-l__‘at:j 


mTe 


-aT 


1-e 


-aT 


-  0 
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(18) 


Ma>  -  I 


•  J  J  (trS-i>  tV* 

-at  -at.  .,,,-01,2  * 

(l-e^Ke  j_1-e  J)  (1"€  > 


Expressions  (17)  and  (18)  can  be  simplified  considerably  if  we  use 
equally  spaced  intervals.  Let  t  -  T/k  whence  t j  ■  jT  .  Then 
(17)  becomes 


£  jZj  -  ad-e^V1  -  m  k  e“°T  (l-e"aT)_1 


and  (18)  becomes 


r2  -at  ,,  -at. -2  _2  -aT.,  -aT.-2 

t  e  (1-e  )  -  T  e  (1-e  ) 


In  (14)  consider  P2(t;a)  -  a/(a+t)  . 


Va) 


a(a+T)  (t.-t-i) 


T  (a+tj)(a+tj_1) 


and  we  obtain 


_zi(tjtj-i-q  l 
a(a+tJ)(a+tJ_1) 


(22)  iG(a) 


j  f_!i_  .  S-i  r’f  h  .  _.Vi  i2  _i_ 

i  Ica+tj)2  («K  (ct+T)2 


Again,  equally  spaced  intervals  simplify  matter  Expression  (21)  becomes 


(2a+2j-l)t) 

(a+jt)(a+(j-l)7) 


m(2cH-T) 
a(a+  T) 


and  (22)  becomes 


<24)  1  l 


(q2-j (j-i)t2)2 
(a+jt)3(a+(j-l)t)3 


(ort-T) 
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We  now  attempt  comparison  of  the  estimation  approaches.  Again 
6  Is  the  MLE  from  (7)  while  fig  is  the  MLE  from  (14). 

We  employ  ideas  similar  to  those  of  Llndley  .  Given  &G  how  may 
we  "correct"  to  fi  .  We  obtain  an  approximation  for  i  H  fig  -  S  . 
Llndley  considers  the  reverse  situation;  from  the  functional  form 
for  the  ungrouped  MLE  he  obtains  a  correction  to  the  grouped  MLE. 

In  our  applications  the  grouped  MLE  is  all  that  is  available.  Llndley 
also  considers  an  l.i.d.  sample  while  our  are  clearly  not  so. 

We  also  seek  to  estimate  the  loss  of  efficiency  due  to  grouping. 
Since  &  is  most  efficient  we  calculate  the  (asymptotic)  relative 
efficiency  of  fi  to  fil  by  l(a)/IG(a)  .  In  fact  the  proportion 
of  information  lost  is  (1(a)  -  IG(a))/I(a)  . 


To  approximate  A  we  assume  equally  spaced  intervals.  Using 
Taylor’s  theorem 


p(jt;a)-  p((j-l)t;a)  wt  P((j-*s)t;a)  +  —  p'HU-^tja) 
whence 

9  log(p(.1t;a)-  p((j-l)I;a))  9  log(p(j-4)~t;a)_  t?  J_ 

9a  *  9a  24  9a 

Hence  using  (14)  and  suppressing  the  arguments  of  p  and 


p"((1-»s)t;a) 

p((j-*5)t;a) 

p  we  have 


a  log  p 
3a 


_L  I  _  q  i  2.1  « 

30  'aG 


0  . 


14 


1 


In  (7),  presume  that  if  «  ((j-l)t,  jt] 

p(w1;a)  w  p((j-%)t;a) 


then 


so  that 


I 


z  3  log  P  _  ■  3  lo&l 


3a 


& 


3a 


a 


To  a  first  approximation,  then 
-2 


(.25) 


-  I?  I2i 


A  ■ 


r  _  3  log  P 

l*  3a2 


m  3  log  p 


d2a 


a. 


Iteration  can  be  employed  to  better  the  approximation.  For 


Pj(t;a)  «  ae  (25)  becomes 


1 

_2  -«GT  1 
T  e 

12 

/o2 

u*r 

For  Pz(t;a)  ■  a/(a+t)  ,  (25)  becomes 


?  >  V?  [ 


A  2  A2 

m  ((aG-W)z+a‘) 
<*G  (SG+T>2 


-  2  l  Z 


>sf 


where  b^  "  +  (J-Wt 
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» 


from  (9)  and  (16) 


For  the  relative  efficiency  of  6L  to  61 

V 


we  obtain 


Eff(ac|a) 


fd  log  pfwja))2 

[3  log  p(T;a)) 

{  3a  j  " 

l  3“  J 

-i  -i  Ta  l2  f 

(p(Tja))  MpCtjjoJ-pftj^jo))  -gj  CpUjScO-pCtj^sa))  -J 


3  log  p(T 
da 


Let  us  consider  our  illustrative  Pj»p2  “sing  equally  spaced 


intervals.  For  ,  we  have 


1  tV" 


(26)  Eff(aJa) 


2  _  2 

°  (l-e~aT' 


“2  -at 
t*  e 


_2  -aT 
T  e 


(l-e"at)2  (l-e“aT)2 


For  fixed 

T  , 

the  following  linits  nay  be  easily  discerned 

i) 

7 

i .  e . 

k  fixed,  a  -*■  0  ,  Eff 

2  2 

-  k*/(k  -1) 

ii) 

t 

i.e. 

k  fixed,  a  -»■  •  ,  Eff 

+  CO 

iii) 

a 

fixed. 

t  +  0  i.e,  k  -+■  00  , 

Eff  -►  1 

iv) 

a 

fixed. 

t  -*•  T  i.e.  k  1  , 

Eff  -*•  • 

Letting 

s  -  aT 

,  with  k  t  -  T  (26) 

becomes 

(27) 

k2  g(s/k)  (g(s)  -  s2) 

S2(g(s)  -  k2  g(s/k)) 

where 

«  _  2j 

g(s)  -  (e8-l)(l-e‘s)  -  ^  ^~yr 
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Table  1  evaluates  (27)  over  a  range  of  s  and  k  values.  For 

T2 

(28)  Eff(aG|a)  - - 3a.2iaj-T>2 - - -  . 

t(a+t)  r  (a2-i(1-l)t2)  1 

aT  u  —  3  —13“  2 

aT  (a+jt)  (a+(j-l)t  )  (a+T Y 


Letting  s  ~  a/T  with  kt  -  T  (28)  becomes 

3sk?fffl)3  Z  — rl)l2 -  3s2  .  1  . 

(sk+j)J(sk+(j-D) 

4 

Table  2  evaluates  (29)  over  a  range  of  s  and  k  values.  The  limiting 
behavior  noted  above  for  (26)  also  occurs  for  (28)  with  "a  -  1/a". 

Regardless  of  what  parametric  form  was  adopted  for  p  and  of  what 
estimation  approach  was  used  the  following  offers  a  simple  way  of  assessing 
fit.  For  any  two  time  points  tj,  t2,  tj  <  t2  ,  X(tj)  |  X(t2>  - 

BjWtj,),  p(t1)/p(t2)).  Hence  (if  X(t2)  >  0)  X(t1>/X(t2)  Is  the 

UMtfU  of  p(tj)/p(t2)  .  For  any  estimate  p(t)  =  p(t,fi)  we  can 
compare  p(tj ;Q) / (p(t2;6l)  with  the  UMVU  ratio. 


(29)  ^ 


17 


4,  An  Application 

One  attractive  application  of  the  above  methodology  Is  to  the 
matter  of  describing  the  active  lifetime  of  a  court  case.  More 
specifically,  associated  with  court  opinions  (particularly  higher  court 
opinions such  as  those  of  the  D.S.  appellate  or  supreme  courts)  is  the 
issue  of  precedential  value.  How  much  effect  will  an  opinion  have  on 
later  opinions?  One  way  to  study  this  point  is  to  record  citations 
of  this  opinion  in  later  cases.  It  is  thus  of  interest  to  examine 
the  pattern  of  subsequent  citation  of  a  case.  It  seems  reasonable 
to  assume  that  subsequent  citations  arrive  according  to  a  Poisson 
process.  Moreover,  although  the  intensity  function  will  vary  greatly 
from  case  to  case  the  expected  proportion  of  citations  to  a  given  time 
need  not.  (For  the  data  to  be  presented,  some  preliminary  empirical 
work  using  fixed  points  t^,  tj  and  comparing  binomials  has 
supported  this  premise.)  That  is,  sparsely  cited  cases  will  receive 
about  the  same  relative  proportion  of  total  cites  by  a  fixed  time  as  will 
heavily  cited  cases.  Our  assumptions  regarding  p(t)  made  in  section 
2  seem  appropriate  in  this  context  since  an  opinion  is  a  bit  like  wine. 

At  first  it  Improves  with  age  (l.e.  is  more  often  cited)  but  then  it 
deteriorates  (i.e.  is  less  often  cited  as  newer  opinion  diminish  its  value). 
Estimation  approach  B  was  forced  in  this  situation.  Data  collection 
recorded  the  date  an  opinion  was  handed  down  along  with  the  dates  of 
subsequent  citation.  It  was  decided  more  convenient  and  reliable  to 
categorize  subsequent  citations  into  intervals  after  decision  date  than 
to  convert  to  the  exact  number  of  days  after  the  decision  date. 
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The  court  cases  studied  in  this  application  are  all  from  the  U.S. 
Court  of  Appeals.  They  are  drawn  from  a  larger  study  (see  Schuchman 
and  Gelfand)  of  the  Federal  appellate  courts.  Two  Independent  samples 

•ere  analysed.-  One  consists  of  100  cases  from  the  Fifth  circuit 
court  of  appeals  while  the  other  consists  of  100  cases  from  the 
rasaining  ten  circuits.  In  each  sampled  case  the  date  of  decision  was 
recorded  (all  dates  are  between  i960  and  1962) .  By  use  of  Shepard'*' 
Index  we  may  find  all  later  cases  citing  any  sampled  case.  By 
these  citing  opinions,  we  may  obtain  the  total  number  of  citations  for 
any  sampled  case  and  a  categorical  time  from  date  of  decision  to  each 
such  citaticn.  Subsequent  citation  of  each  case  in  «arb  sample  was 
considered  through  December  1976  thus  achieving  a  nmm  of  13  full 
years  of  obsauvrtion  with  numerous  cases  reaching  to  nearly  16  full 
years. 

4 

For  convenience  we  set  T  •  12  and  consider  t  »  2,  k  «  6  . 

The  resulting  z  vectors  are  given  in  Table  3.  Note  that  the 
fifth  circuit  opinions  are  more  frequently  cited  than  their  other 
circuit  counterparts.  Furthermore,  we  find  an  inexplicable,  but 
common,  bump  in  both  samples  occurring  at  Z  ^  ,  i.e.  for  the  period 
between  6  and  8  years  after  date  of  decision.  A  test  of  homogeneity 
between  the  samples  yields  x  *1.290  with  descriptive  level  -  .93  . 

Table  A  presents  the  estimation  of  a  using  both  p^(t;a)  and 
P2(t;a)  .  We  see  that 
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(I)  Both  functions  Pj  and  p2  fit  the  data  well 

(II)  Estimation  by  MLE  and  MMx2  yield  very  similar  estimates  of  a 

(III)  The  estimates  of  a  for  the  two  samples  are  fairly  close 

'  (as  expected  from  the  test  of  homogeneity) 

(iv)  For  the  5th  circuit  data  using  p^  and  ~  .09, A  *  0.0027 

while  for  p2  and  A  ■  1.038. 

(v)  From  Tables  1  and  2  the  relative  efficiency  for  p^  under  the  given 
grouping  is  1.03,  for  p2>  1.04. 

In  Table  5  we  compare  the  fits  of  and  p2  with  the  "best" 

(i.e.  UMVU)  estimate  as  described  at  the  end  of  the  previous  section. 

He  look  at  t^  relative  to  t^+j  ,  t .  •  it  .  Again  the  fits 

look  quite  good  l.e.  the  maximum  relative  error  for  the  exponential 
model  is  .03,  for  the  bilinear  model  .04. 

How  can  it  be  that  two  such  different  functions  Pj  and  p2 

fit  the  data  almost  Identically?  Table  6  clarifies  the  matter. 

It  also  provides  representative  values  of  the  function  over  time 
using  a  *  .09  in  p^  and  a  ■  18  in  p2  .  We  see  that 
during  the  time  interval  t  -  2  through  12  years  p2(t)/p2(t) 
is  virtually  constant.  Since  the  assessment  of  fit  Involves  p(t) 
in  a  relative  way,  over  the  12  years  of  observation  and  p2 

will  be  virtually  indistinguishable.  However  in  an  absolute  sense 

p^  is  preferred  to  p2 .  This  follows  by  asking  how  many  years 

we  expect  to  wait  for  a  specified  proportion  of  subsequent  citations 
e.g.  what  is  t  such  that  p^(t)  ■  .9  ,  such  that  p2(t)  “  .9  ? 
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For  Pj  the  answer  Is  25.6  years,  for  p2,  162  years.  Additionally 
C33.3)  “  .95  and  p2(51.1)  ■  .99  .  These  proportions  are 
reasonable  both  empirically  and  Intuitively.  Thus  an  exponential 
Intensity  function  seems  an  effective  descriptor  for  subsequent  citation 
of  an  opinion. 
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Table  1:  Relative  Efficiency  for  p^(t;ct)  »  i_e’"at 


2  4  6  10  20  50 


.01 

.05 

.1 

.5 

1 

5 

10 


34.337 

7.683 

4.364 

1.778 

1.500 

1.344 

1.336 


11.896 

3.034 

1.951 

1.170 

1.102 

1.067 

1,067 


7.412 

2.119 

1.491 

1.075 

1.044 

1.303 

1.029 


4.428 

1.529 

1.210 

1.027 

1.016 

1.010 

1.010 


2.480 

1.178 

1.061 

1.007 

1.004 

1.003 

1.003 


1.445 

1.035 

1.010 

1.001 

1.001 

1 

1 


Table  2:  Relative  Efficiency  for  {^(tja)  •  t/(a+t) 
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Saaple  Zx  &2  Z3  Z 4  Z$  Z 


Fifth  Circuit 

225 

191 

145 

152 

112 

85 

Other  Circuits 

182 

144 

114 

123 

84 

78 

Table  3  :  The 

Fifth 

Circuit 

and  Other 

Circuit  Data 
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p, (t;a)  I  Fifth  Circuit  |  .0893  |  .0888  |  4.679(.32) 
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Examining  the  Exponential  &  Bilinear  Fits. 
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